Fine-Grained Checkpointing in Distributed Object Systems
نویسنده
چکیده
The paper discusses problems of checkpointing in distributed object systems and presents an algorithm suited optimally to their fine-grained structure. Usually, checkpoint algorithms assume nodes or processes as system units. This assumption results in a coarse-grained structure of checkpointing. We will show that this difference in granularity makes usual checkpoint algorithms inadequate. The presented algorithm is adapted to distributed object systems because it requires minimal overhead per object, thread and message. Only the relevant parts of the system are checkpointed and rolled back. Furthermore, checkpointing can be initiated by every thread without prior synchronization. A central coordinating instance is obsolete in the system. The algorithm pertains to the class of coordinated algorithms and therefore freezes parts of the system for a certain period of time. It is shown that freezing can be reduced to a minimum by overlapping ongoing computations with checkpointing.
منابع مشابه
Exploring Checkpointing and Closed Nesting in Distributed Transactional Memory
Checkpointing and closed nesting are mechanisms typically used for implementing partial roll-back in transactional systems. Closed nesting limits the amount of work to redo on an abort by allowing sub-transactions to abort and retry independently from their parents. Checkpointing goes further and allows a transaction to be rolled back to any previous point where a checkpoint was saved. Checkpoi...
متن کاملLightweight write detection and checkpointing for ne - grained persistence
Many systems must dynamically track writes to cached data, for the purpose of reconciling those updates with respect to the permanent or global state of the data. For example, distributed systems employ coherency protocols to ensure a consistent view of shared data. Similarly, database systems log updates both for concurrency control and to ensure the resilience of those updates in the face of ...
متن کاملAn Architecture for Dynamic Reconfiguration in a Distributed Object-Based Programming Language
Distributed applications ideally allow reconfiguration while the application is running, but changes are usually limited to adding new client and server processes and changing the bindings among such processes. In some application domains, such as real-time financial services, it is necessary to support finer grained reconfiguration at the level of entities smaller than processes, but for perfo...
متن کاملA Study on Fine-Grained Replications of Distributed Java Applications
In distributed object systems, object-oriented (OO) applications are replicated from remote servers to client sites to improve performance, scalability, and availability. This study focuses on fine-grained replications of distributed OO applications. Unlike the traditional replication scheme by which a self-contained application is replicated entirely at once, the finegrained replication scheme...
متن کامل